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Data-dependent probability matching 
priors for empirical and related likelihoods 

Rahul Mukerjee 1 
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Abstract: We consider a general class of empirical-type likelihoods and de- 
velop higher order asymptotics with a view to characterizing members thereof 
that allow the existence of possibly data-dependent probability matching pri- 
ors ensuring approximate frequentist validity of posterior quantiles. In partic- 
ular, for the usual empirical likelihood, positive results are obtained. This is 
in contrast with what happens if only data-free priors are entertained. 
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1. Introduction 

Although empirical and related likelihoods have received significant attention (see 
[11, 14] and the references therein), their study from a Bayesian perspective began 
only in recent years. Lazar [10] pioneered an investigation on the validity of the 
empirical likelihood for posterior inference and examined, mostly by simulation, 
the frequentist properties of posterior empirical likelihood intervals. In another 
significant development, Schennach [18] proposed a Bayesian exponentially tilted 
empirical likelihood arising as a nonparamctric limit of a Bayesian procedure which 
places a kind of noninformativc prior on the space of distributions. Starting from 
a general class of empirical-type likelihoods for the population mean, Fang and 
Mukerjee [7] characterized its members which admit probability matching priors in 
the sense of allowing posterior credible sets with approximate frequentist validity. 

Along the line of what is traditionally done in parametric inference based on the 
true likelihood, Fang and Mukerjee [7] entertained only priors that are free from the 
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data. They observed, among other things, that none of the standard empirical- type 
likelihoods that have been proposed and widely studied in the literature, includ- 
ing the usual empirical likelihood, admits a probability matching prior even with 
margin of error o(n -1 / 2 ), where n is the sample size. This is somewhat disappoint- 
ing and, given the popularity of these likelihoods, prompts one to investigate the 
consequences of working with possibly data-dependent priors with the hope that 
this may yield more positive results. The present article aims at exploring this issue 
with reference to the same general class as in [7] . Satisfyingly, it is seen that at least 
for the usual empirical likelihood, positive results then emerge even with margin of 
error o(r7, _1 ). 

Probability matching priors have been studied extensively in parametric infer- 
ence-see, for example, [6, 9, 12], and [20]. A key difference with the parametric 
case is that here we are not working with the true density-based likelihood and, 
as such, a shrinkage argument, suggested originally by J. K. Ghosh to the present 
author, that simplifies the frcquentist calculations there ([6], Ch. 1) is no longer 
applicable. As a result, one has to employ a direct Edgeworth expansion based on 
approximate cumulants. 

While the present work seems to be the first attempt towards exploring the 
higher-order asymptotics on data-dependent priors with empirical-type likelihoods, 
such priors have received considerable attention in recent years in parametric infer- 
ence via the true density-based likelihood. A brief indication of this literature may 
interest the reader. A key reference in this regard is [21], where it was found that 
for certain mixture models, no data-free improper prior yields a proper posterior 
and no data-free proper prior entails frcquentist validity of posterior quantilcs with 
margin of error 0(n _1 ), while both problems are solved by a data-dependent prior. 
Furthermore, such data-dependent priors were shown to approximate data-free pri- 
ors, in addition to enjoying desirable properties like asymptotic minimaxity. Prior 
to [21], data-dependent priors were considered, among others, by [15] and [17] in the 
context of mixture models with an unknown number of components. Sweeting [19] 
investigated the crucial role played by data-dependent probability matching priors 
when the sample size is stochastic, as happens, for instance, with censoring or a 
stopping rule. Reid et al. [16] reviewed and discussed a notion of strong matching 
which requires data-dependent priors. 

In the context of parametric inference, Clarke and Yuan [4] studied partial infor- 
mation reference priors obtained through the maximization of conditional Shannon 
mutual information. These priors are often data-dependent in the sense of involv- 
ing statistics that are associated with nuisance parameters and capture helpful side 
information. The information theoretic interpretation for these priors was also dis- 
cussed at length by [4]. Clarke ([3], Subsection 7.2) discussed data-dependent priors 
in the light of the Freedman-Purves Theorem [8] , which often forms the basis of the 
argument put forward by orthodox Bayesians against such priors on the grounds 
of incoherence. He argued that the implications of this theorem are narrower than 
commonly appreciated, suggested a remedy in the form of a criterion of information 
boundedness, and observed that the data-dependent priors in [4], [16], and [21] are, 
indeed, information bounded. 

The interested reader may refer to the papers cited in the last two paragraphs 
for further references on data-dependent priors in parametric inference. 
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2. A general class of empirical-type likelihoods: posterior quantiles 

Let X\ , . . . , X„ be independent scalar- valued random variables from an unknown 
common distribution with an unknown mean 9. The parameter space for 9 is the 
real line or an open interval thereof. The Xi are supposed to be absolutely con- 
tinuous and the first four population moments are assumed to exist [2]. These 
assumptions justify an Edgeworth expansion used later. Let X ~ n ~ x 5_^" =1 

m s = rc. -1 Xrili(^t ~X) S , s = 2,3,..., g 3 = m 3 jm^ 2 , g A = m 4 /ml and y = 
y{9) = (n/m,2) 1 ^ 2 {9 — X). Write </>(•) for the standard univariate normal density. 
As in [7], we consider a general class of empirical-type likelihoods of the form 

L(0) cx c/ ) (y)[l + n-^ 2 {a 1 (g 3 )y + as(g 3 )y 3 } 

(1) +n^ 1 {b a (g 3 ,g 4 ) + b 2 (g 3 ,g4)y 2 + b 4 (g 3 , g A )y A + b 6 (g 3 , g^y 6 } 
+o p {n~ 1 % 

where the di(-) are polynomials in g 3 and the bi(-) arc polynomials in g 3 and (74, the 
coefficients therein being constants free from n. These polynomials depend on the 
particular likelihood. Note that L(9) depends on 9 through y — y{9) but does not 
involve any other population parameter, and that y is the standard pivotal quantity 
for inference on 9 when the population variance is unknown. Furthermore, the terms 
of order n -1 / 2 and in (1) aim at taking care of the unknown skewness and 
kurtosis of the population via their sample analogs 173 and 174 respectively. Indeed, 
the class (1) is very wide and, as discussed later in Section 4, covers all major 
empirical-type likelihoods proposed in the literature. 

With reference to any likelihood in the class (1), we consider a possibly data- 
dependent prior of the form 

(2) tt(0) =exp{^,m 2 , 53 )}, 

where ip(-) is a smooth function with functional form free from n. We aim at char- 
acterizing the cii(-) and bi(-) in (1) so as to allow the existence of a prior of the 
form (2) that entails frcquentist validity of the posterior quantiles of 9, with mar- 
gin of error o(n -1 / 2 ) or o(n _1 ). The class (2) can be motivated as follows. For any 

empirical- type likelihood as in (1) and a data- free prior, up to the first order of 

_ 1/2 

approximation, the quantity (9 — X)m 2 represents a standardized version of 9 
in the posterior setup; see [7] . This prompts one to consider a data-dependent prior 
of the form 

(3) 7:(9)=eM(0-X)m- 1/2 X (g 3 )}, 

where the multiplier x(ff3) i s a smooth function of g 3 , with functional form free of 
n, that aims at taking care of the population skewness (an attempt to take care of 
the population kurtosis would involve a more elaborate data dependent prior and 
a discussion of this is deferred till Section 5). Clearly, the prior in (3) is equivalent 
to 

(4) n(9) = exp{0m 2 - 1/2 x( ff3 )}, 

because they both lead to the same posterior. The exponent in (4) is a smooth 
function of 9, 777,2 and g 3 , and the class (2) incorporates all priors that share this 
feature. 
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Let ipi{ti,t 2 ,h) = difj(t 1 ,t 2 ,t 3 )/dt i , 4> ij {t 1 ,t 2 ,t 3 ) = d 2 ip(ti,t 2 ,t 3 )/dtidtj, tpi = 
ipi(X,m 2 ,g 3 ) and ipij = i , ij{^i m 2i9z)i hj = 1,2,3. Then, analogously to the 
parametric case ([6], Ch. 2), the posterior density of y = y{9), with reference to (1) 
and under 7r(-) as in (2), can be expressed as 



(5) +n- 1 {R 2 (y 2 - 1) + R 4 (y 4 - 3) + R 6 (y 6 - 15)}] + o p ( 

where X = (Xi, . . . , X n ) and 



Ri = 0-1(93) + ml^ipi, 

R2 = b 2 (g 3 ,g 4 )+ml /2 a 1 (g 3 )4' 1 + ^m 2 (ip 11 +if;l), 

(6) R 3 = 03(53), 

1/2 

Ri = h{93, 9i) +m 2 ' 03(33)^1, 

R(> = b 6 (93,g4)- 

The propriety of the posterior is assumed here. Let 

ui = i?i + R 3 {z 2 + 2), 

(7) u 2 = 2uxzR 3 - -u\z + R 2 z + R 4 (z 3 + 3z) + R 6 (z 5 + 5z 3 + 15z), 

where z is the (1 — a)th quantile of a standard normal variate. As in [7], recalling 
that y = (n/m 2 ) 1 ^ 2 (9 — X), then it follows from (5) that the (1 — a)th posterior 
quantile of 9 can be approximated by 

(8) 9[ 1 - a) (7T,X)^X + ( m2 /n) 1 ^ 2 (z + n- 1 / 2 u 1 ), 
or 

(9) 9 { 2 1 - a) {TT 7 X) = X + {m 2 /n) 1 / 2 {z + n- 1 / 2 u 1 +n- 1 u 2 ), 
with posterior coverage error o p (n -1 / 2 ) or o p (n~ 1 ) respectively. 

3. Prequentist coverage 

3. 1 . Calculation of frequentist coverage 

We next study the frequentist coverage of the interval (—00, 9^ (-k, X)] . The steps 
are similar to those in [7] but more involved because of possible data-dependence 
of the prior; for instance, additional terms appear in the expressions for Wi and k 2 
in (15) and (21) below. 

With P representing the frequentist probability, by (9) and the definition of y, 
the frequentist coverage is given by 

(10) P{9 < 9 < 2 1 ~ a) (ir,X)} = P(y < z + n - 1/2 u 4 + n- l u 2 ). 



In order to obtain an expression for the above with margin of error o(n 1 ) , we need 
stochastic expansions for y, u 4 and u 2 . To this end, let E denote expectation for 
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fixed 6 and write a 2 = E{X, - 9) 2 , Z, = {X, - 9)/a, (3 S = E{Z?), 1 < s < 4, and 
A s = n- 1 ' 2 YZ=i(Z' ~ M, s = 1, 2, 3. Then, as in [7], 

(11) y = -A, + ^n- 1 ' 2 A 1 A 2 - n^^Af + ^A,A 2 2 ) + o^n" 1 ). 

Turning next to u\ and u 2 , we note from (7) that the randomness of these quantities 
is only due to the Ri. For each i, let Rio be obtained from Ri in (6) replacing X, 
m2, 53 and therein by the corresponding population parameters 9, a 2 , /3 3 and 
/34 respectively, i.e., 

Rio = ai(/? 3 )+a<\ 

i? 2 o = h(p3,fa) + (r ai (f3 3 )4 0) + \a 2 [i^ + {4 0) } 2 ], 

(12) R 30 = a 3 (/3 3 ), 

-^60 = b 6 (/3 3 , fa), 

where - ^(0, a 2 , ft), ^ = iMj(9,a 2 ,fa), i,j = 1,2,3. Since 

(13) X = 6» + n- 1/2 crAi, m 2 = ct 2 (1 + n~ 1/2 ,4 2 ) +o p (n~ 1/2 ), 
and 

(14) .g 3 = /33+«~ 1/2 (A 3 -3A 1 -^/3 3J 4 2 ) + 0p (n- 1 / 2 ), ff4 = & + o p (l), 

from (6) we get Ri = R M + n~ 1 / 2 W i + o p (n -1 / 2 ), i = 1,3, and R t = R iQ + o p (l), 
i = 2,4, 6, where 

W 1 = a 2 ^A 1+ a{ l -4^ + a 2 ^}A 2 

(15) +WM + ^KAs - 3A! - ^/3 3 A 2 ), 

(16) W 3 = 4(03)(4j-34i-!&A 2 ), 

and a^(-) is the derivative of fli(-). From (7), it is now evident that 

Mi = ii 10 + n" 1/2 {PFi + W 3 (z 2 + 2)} + o p (n- 1/2 ),u 2 = u 20 + o p (l), 
where the leading terms 

(17) uio=i?io+ii 3 o0 2 + 2), 
and 

(18) U20 = 2u w zR 30 - -u? z + i? 20 z + R 4 o(z 3 + 3z) + R 60 (z 5 + 5z 3 + 15z), 

are simply counterparts of (7) with the Ri there replaced by the RiQ. 

From (10) and the stochastic expansions for u\ and u 2 as indicated above, 

(19) P{9 < 6»^ q) (tt, X)} = P(y<z + n _1 / a u 10 + n" 1 ^) + ofa" 1 ), 
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where y = y — n~ 1 {W\ + W 3 (z 2 + 2)}. By (11), (15) and (16), the first four approx- 
imate cumulants of y are given by 

K Xn = n-^h+oin- 1 ), 
K 2n = 1 + n~ 1 k 2 +o(n~ 1 ), 
K 3n = n-V 2 k 3 + oin- 1 ), 
= n~ ki + o(n _1 ), 

where 

(20) h = l-P 3 , fc 3 = 2/3 3 , fc 4 = 12 + 12/? 3 2 - 2/3 4 



2' 



and 



+- -.Pi + 2{oi(/%) + a' 3 (l3 3 )(z 2 + 2) + - 3 - \p 2 ) 

1 



2,/,(o), r.a „A „,,(o) , * ,(oy 



(21) +2a^>+2P 3 a{-^ , +^}- 

The fact that Wi and W3 are linear in the Ai facilitates the derivation of (20) and 

(21) . From (19), consideration of an Edgcworth expansion for y now yields 

(22) P{6 < ^" Q) (^, X)} = 1 - a + {n-^Ai + n' 1 A 2 )cj)(z) + o^" 1 ), 
with 

(23) A a = u w -k l -\k 3 {z 2 -1) 

6 



and 



m 2 o - \u 2 w z + zu w {k x + \k 3 {z 2 - 3)} - \{k 2 + k\)?. 



(24) -(^fc 4 + ifcifc 3 )(^ - 3z) - ^k 2 3 (z 5 - 10z 3 + 15*). 
3.2. Probability matching conditions 

The frequentist coverage in (22) equals 1 — a + o(n~ 1/l2 ) if and only if Ai =0 
identically in z and the population parameters. Since by (12), (17), (20) and (23), 

Ai = ai(&) + 2a 3 (P 3 ) + <y^f ] - ift + {a 3 (/3 3 ) - ^/3 3 }z 2 , 

recalling the definition of ip^' , it is clear that the above happens if and only if 

(25) a 3 ((3 3 ) = ^P 3 and ^(0, a 2 , fo) = h(a 2 , fa) - Oa' 1 { ai ((3 3 ) + ift}, 

h(-) being any smooth function of a 2 and [3 3 . Note that the first condition in (25) 
is on the empirical-type likelihood whereas the second condition concerns the prior. 
Indeed, with tp(-) as in (25), it is easily seen from (2) that the specific choice of h(-) 
has no influence on the posterior. Hence, hereafter, we take h(a 2 , (3 3 ) = 0, i.e., 

(26) tt(0) = cxp[-0m 2 - 1/2 { ai (. g3 ) + \gz}}, 
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by (2) and (25), and continue with (25) to obtain further conditions that arise when 
one wishes to work with margin of error o(n _1 ). Observe that the prior in (26) is 
actually of the form (4) that motivated the class (2) of priors. From (12), (17), 
(18), (20), (21) and (24), after considerable algebra, it can be seen that under (25), 
A 2 = dz + C 3 z 3 + C 5 z 5 , where 

Ci = b 2 (^,l3 i )+3b i (Ps,l3 4 ) + 15b 6 (l3 3 ,0 i )-UaM} 2 -/3 3 a 1 (f3 3 ) 



~Wa@3 ~ 777^4 + 77j 

1 „ 2 „ 1 1 



24' 3 12"" ' 2' 



C 3 = 6 4 (/3 3 , At) + 56 6 (/3 3 , /3 4 ) - + ~ f A + ^ . 

C 5 = 6 6 (/3 3 ,/3 4 )-^3 2 - 

Thus, under (25), the frequentist coverage in (22) equals 1 — a + o(n ) for every z 
and every possible 6, a 2 , f3 3 and /3 4 if and only if C±, C 3 and C5 vanish identically 
in the population parameters, i.e., if and only if 

M/?3,/3 4 ) = ^{ai(/3 3 )} 2 + ^ 3 2 -5/3 4 + l, 
(27) M/3 3 ,&) = \foa 1 {fo)- l -[3i + - A [5 i - l - 1 

The conditions in (27) are again on the likelihood. 



4. Implications 

We now examine some major subclasses of empirical-type likelihoods in the light 
of the conditions obtained in the last section. These are (i) likelihoods arising from 
empirical discrepancy statistics ([5], Section 1) and hence from Cressie-Read dis- 
crepancy statistics [1], (ii) generalized empirical likelihoods [13], and (iii) generalized 
empirical exponential family likelihoods ([ ], Section 4). As noted in [7] all these 
belong to the general class (1). Moreover, it can be shown that the forms of the dj(-) 
and bi(-) for the subclasses (i)-(iii) are as in Tabic 1. In this tabic, r 3 , r 4 , 73, 7 4 
and n are constants that depend on the particular likelihood. The usual empirical 
likelihood belongs to each of (i)-(iii) with 

a i(ff3) = 0, 03(33)^^.93 
(28) b (g 3 ,g 4 ) = b 2 {g 3 ,g 4 ) = 7 

M33,.9 4 ) = - + !)> b 6 (g 3 ,g 4 ) = —g%, 

while Schennach's Bayesian exponentially tilted empirical likelihood [18] belongs 
to (iii) with fj, = |. From (25) and Table 1, it is clear that each likelihood in the 
subclass (iii) admits, with margin of error o(rt -1 / 2 ), a data-dependent probability 
matching prior of the form (2) for posterior quantiles. Since each of these likelihoods 
has (ii(g 3 ) = 0, by (26), such a prior is given by 



(29) 



7r(0) = cxp(--6>m 2 1/2 g 3 ). 
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Table 1 

Forms of the ai(-) and bi(-) for the subclasses (i)-(iii) 

Subclass 01(93) 0-3(93) 00(93,94) 02(93,94) 04(93,94) &6(93,34) 

(i) r 49 4-fr^(^ + l) 

(ii) 7393 7494 - §7393 ~ 3 73 + | §7393 

(iii) j?93 W4 -(^+l)( g | + l) Xg| 



Table 2 

Simulation results on the frequentist coverage of (— 00, 0J 1 a '(7r, X)] /or generalized empirical 
exponential family likelihoods, with tt(9) as in (29) 
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b size 








SampL 


b size 




Distribution 


1 - a 


8 


12 


16 


20 


1 - a 


8 


12 


16 


20 


Normal(0,l) 


0.95 


0.912 


0.928 


0.933 


0.938 


0.10 


0.138 


0.123 


0.119 


0.114 




0.90 


0.863 


0.877 


0.884 


0.886 


0.05 


0.088 


0.074 


0.069 


0.064 


Uniform(0,l) 


0.95 


0.934 


0.944 


0.946 


0.949 


0.10 


0.112 


0.106 


0.102 


0.102 




0.90 


0.887 


0.896 


0.897 


0.898 


0.05 


0.067 


0.056 


0.051 


0.051 


Bcta(l,2) 


0.95 


0.910 


0.928 


0.936 


0.938 


0.10 


0.110 


0.108 


0.106 


0.104 




0.90 


0.861 


0.880 


0.888 


0.890 


0.05 


0.061 


0.055 


0.055 


0.054 


Exponential(l) 


0.95 


0.850 


0.878 


0.898 


0.906 


0.10 


0.111 


0.113 


0.111 


0.111 




0.90 


0.798 


0.827 


0.845 


0.854 


0.05 


0.063 


0.064 


0.063 


0.061 


Raylcigh(l) 


0.95 


0.900 


0.918 


0.928 


0.931 


0.10 


0.119 


0.113 


0.111 


0.108 




0.90 


0.849 


0.868 


0.878 


0.880 


0.05 


0.070 


0.064 


0.060 


0.056 



From Table 1, it is also clear that none of the likelihoods in the subclasses (i)-(iii) 
meets the first condition in (27) because ai(-) = &2(-) = for any such likelihood. 
Thus, even with possibly data-dependent priors of the form (2), none of them allows 
frequentist validity of posterior quantiles with margin of error o(n _1 ). In Section 5, 
it will be seen that at least for the usual empirical likelihood this difficulty can be 
resolved by considering more elaborate data-dependent priors. 

Before addressing this issue, we present some simulation results to indicate the 
finite sample implications of the aforesaid probability matching property of the prior 
(29) for the subclass (iii) of generalized empirical exponential family likelihoods. 
Since this matching holds with margin of error o(n -1 / 2 ), it makes sense to study 
the simulated coverage of the interval (— 00,8^ (ir, X)] in this context, where 
0^ a ^ (ir, X) approximates the (1 — a)th posterior quantile of with coverage error 
o p (n~ 1 / 2 ); sec (8). For any likelihood in (iii), it can be seen from (6), (7), (8) 
and Table 1 that (tt , X) = X + (m 2 /n)^ 2 {z + in- 1 / 2 53 (2z 2 + 1)}, under 

(29). The simulation results, each based on 10000 simulations arc presented in 
Table 2. Five distributions for the population along with four choices of 1 — a, 
namely 1 — a=0.95, 0.90, 0.10 and 0.05, are considered. In all cases, except for the 
exponential distribution in the right tail, the convergence to the desired frequentist 
coverage turns out to be reasonably fast. Thus the asymptotic results, even with 
margin of error o(n -1 / 2 ), arc well-reflected in finite samples. 



5. More elaborate data-dependent priors 

Observe that the prior in (26) is equivalent to 



(30) 



tt(0) = exp[-(0 - X) m - 1/2 { ai (g 3 ) + -a,}], 
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in the sense that they both lead to the same posterior; cf. (3) and (4). This motivates 
us to consider possibly data-dependent priors of the form 

(31) tt(0) = cxp{(0 - X)m- 1/2 X (g 3 ) + 1(0 - Xfm^X(g 3 , gi )}, 

where %(•) and A(-) are smooth functions. Since (31) can possibly involve X and <?4 
in addition to 77i 2 and g 3 , it is more elaborate than (2). The introduction of A(-) in 

(31) aims at taking care of the population kurtosis. 

With reference to any empirical- type likelihood in the general class (1), if one 
considers the posterior quantiles of under (31), then algebra similar to but heavier 
than that in Sections 2 and 3 reveals the following: 

(a) Frcqucntist validity of the posterior quantiles holds with margin of error 
o(n -1 / 2 ) if and only if 

(32) a 3 (ft) = 1ft and x (ft) = ~{ai(ft) + 1ft}. 

(b) Frequentist validity of the posterior quantiles holds with margin of error 
o(n _1 ) if and only if in addition 

&4(/%, ft) = l/Wft) " \PI + jft - 5. 

(33) Mft,ft) = ^ft 2 , 
A(ft, ft) = {ai(ft)} 2 - 26 2 (ft, ft) + 1ft 2 - 1ft + 2. 

A comparison between (25), (27) and (32), (33) shows that the last condition 
in (33) is new and this helps. The first condition in (32) as well as the first two 
conditions in (33) are on the empirical- type likelihood. From Table 1, it can be 
seen that any likelihood in the subclasses (i)-(iii) meets these three conditions if 
and only if the associated a^(-) and bi(-) are given by (28), which corresponds to 
the usual empirical likelihood. Furthermore, if (28) holds then the last conditions 
in (32) and (33) yield x(ft) = -5ft and A(ft,ft) = fft 2 - fft + 2, so that by 
(31), the data-dependent prior 

(34) tt(0) = exp{-l(0 - X)m- 1/2 g 3 + 1(0 - Xfm^gl - \ 9i + 2)}, 

ensures frcqucntist validity of the posterior quantiles with margin of error o(n _1 ). 

A comparison between the prior just obtained and the one shown in (29) is in 
order. The one in (29) leads to probability matching to a lower order of accuracy 
but at the same time enjoys the merit of being much simpler. Moreover, as the 
simulation results reveal, it performs quite well in finite samples. Thus a choice 
between the two is essentially a matter of taste. If one wishes to work with a simple 
prior then the one in (29) is recommended. On the other hand, if a premium is put 
on higher order accuracy from asymptotic considerations, then the one obtained in 
this section appears to be more attractive. 

The connection with a result in [7] is worth noting at this stage. They worked 
with data- free priors and showed that a likelihood in the general class (1) admits a 
probability matching prior, with margin of error o(n _1 ), for posterior quantiles if 
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and only if 

With the ai(-) and fej(-) as above, the conditions in (32) and (33) are met if and 
only if xifa) = A(/?3, /?4) = 0. In conjunction with (31), one thus gets the flat prior 
and this agrees with the findings in [7]. 

6. Concluding remarks 

The results in Subsection 3.2 show that if a 3 ((3 3 ) = ^/3 3 then the prior in (26) 
ensures frequentist validity of posterior quantilcs with margin of error o(n -1 / 2 ). 
It is satisfying to note that under the present assumption on the existence of the 
first four population moments, this margin of error is actually 0(n _1 ); vide (22). 
Similarly, if the existence of the first five population moments is assumed, then 
under the prior in (34), the frequentist validity of posterior quantiles arising from 
the usual empirical likelihood holds actually with margin of error 0(n -3 / 2 ). 

By (13) and (14), the data-dependent probability matching prior in (34) satisfies 
tt{9) = 1 + o p (l) . The same holds for the prior in (30) which is equivalent to the 
probability matching prior in (26) in the sense of yielding the same posterior. Thus 
these data-dependent priors approximate the flat prior which is a natural data-free 
prior in the present context; cf. [7] and a well-known result for fully parametric 
location models ([6], Chapter 2). 

There is scope for extending the present results in several directions. For exam- 
ple, if instead of posterior quantiles, interest lies in the highest posterior density 
regions, then the findings in [7] show that none of the standard empirical- type like- 
lihoods proposed in the literature, including the usual empirical likelihood, admits 
a data-free probability matching prior in a higher order asymptotic sense. A nat- 
ural question is whether consideration of data-dependent priors, such as those of 
the form (31), can yield more positive results. Another important issue concerns 
the role of data-dependent priors in the multivariate case with vector 6. Then the 
algebra will be rather complicated because the empirical-type likelihoods as well 
as the data-dependent priors will involve multivariate pure and mixed moments in 
place of 7712, 173 and (74, and one would need to consider multivariate Edgeworth 
expansions for the frequentist calculations. It is hoped that the present work will 
generate interest in these directions. 
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